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Abstract 

We give a short proof that the largest component Ci of the random 
graph G(n, 1/n) is of size approximately . The proof gives explicit 
bounds for the probability that the ratio is very large or very small. In 
particular, the probability that exceeds A is at most 

for some c > 0. 


1 Introduction 

The random graph G{n,p) is obtained from the complete graph on n vertices, 
by independently retaining each edge with probability p and deleting it with 
probability 1 — p. Erdos and Renyi [8] introduced this model in 1960, and 
discovered that as c grows, G{n,c/n) exhibits a double jump: the cardinality 
of the largest component Ci is of order log n for c < 1 , of order for c = 1 
and linear in n for c > 1. In fact, for the critical case c = 1 the argument in 
[8] only established the lower bound on P(|Ci| > for some constant 

A > 0 ; the upper bound was proved much later in [ 1 ] and m- 

Short proofs of the results stated above for the noncritical cases c < 1 
and c > 1 can be found in the books [2], 0, and m- However, we could not 
find in the literature a short and self-contained analysis of the case c = 1. 
We prove the following two theorems: 

Theorem 1 (see [16] and [17] for similar estimates) LetCi denote the 
largest component of G{n, 1/n), and let G{v) he the component that contains 
a vertex v. For any n > 1000 and A > 8 we have 

P(|C'(u)| > Ar?!'^') < 32 ^ , 
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and 


n/Q 4 a2(A-4) 

P(|Ci| > An^/^) < -e-W-^ . 

Theorem 2 For any 0 < <5 < 1/10 and n > 200/(5^/®, the random graph 
G(n, 1/n) satisfies 

P(^|Ci| < [< 5 n 2 / 3 j^ < 1553/5 _ 

While the estimates in these two theorems are not optimal, they are 
explicit, so the theorems say something about G(n, 1/n) for n = 10® and 
not just as n ^ 00 . The theorems can be extended to the “critical window” 
p = l/n+An“^/3^ gee Section [6l As noted above, Erdos and R&yi [8] proved 
a version of Theorem [2l their argument was based on counting tree com¬ 
ponents of G(n, 1/n). However, to prove Theorem [T] by a similar counting 
argument requires consideration of subgraphs that are not trees. Indeed, 
with such considerations, Pittel m proves tail bounds on n that 

are asymptotically more precise than Theorems [T] and [2j For a probabilis¬ 
tic approach to Theorem [T] that does not use martingales, see Scott and 
Sorkin [T7] . 

The systematic study of the phase transition in G{n,p) around the point 
p ~ 1/n was initiated by Bollobas [1] in 1984 and an upper bound of order 
n2/3 £qj. median (or any quantile) of \Ci\ was first proved by Luczak [12j . 
Luczak, Pittel and Wierman m subsequently proved the following more 
precise result. 

Theorem 3 (Luczak, Pittel and Wierman 1994) Let p = ^ + 

where A G M is fixed. Then for any integer m > 0, the sequence 

(n-2/3|Ci|,n-2/3|C2|,...,n-®/3|C,„|) 
eonverges in distribution to a random veetor with positive components. 

The proofs in m, US] and [E] are quite involved, and use the detailed 
asymptotics from m, m and [3] for the number of graphs on k vertices 
with k + i edges. Aldous [T] gave a more conceptual proof of Theorem [3] us¬ 
ing diffusion approximation, and identified the limiting distribution in terms 
of excursion lengths of reflected Brownian motion with variable drift. The 
argument in [T] is beautiful but not elementary, and it seems hard to ex¬ 
tract from it explicit estimates for specific finite n. A powerful approach, 
that works in the more general setting of percolation on certain finite tran¬ 
sitive graphs, was recently developed in [6]. This work is based on the lace 
expansion, and is quite difficult. 
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Our proofs of Theorems [T] and [2] use an exploration process introduced 
in |14j and im, and the following classical theorem (see, e.g. [7] section 4, 

or [T8]i. 

Theorem 4 (Optional stopping theorem) Let {Xt}t>o be a martingale 
for the increasing a-fields {Xt} o,nd suppose that ti,T 2 are stopping times 
with 0 < Ti < r 2 . If the process {-^tAT 2 }t>o 'Is bounded, then 

Remark. If {Xt}t>o is a submartingale (supermartingale), then under un¬ 
der the same boundedness condition, we have E Xr^ < E X^^ (respectively, 
EX^j > EX^J. 

The rest of the paper is organized as follows. In Section [2] we present the 
exploration process mentioned above. In Section [3] we present a very simple 
proof of the fact that in G{n,l/n) we have P(|Ci| > 

The proof of Theorem [1] and [2] are then presented in Sections S] and [5l 
The technical modifications required to handle the “critical window” p = 
1/n -|- are presented in Section [6l 

2 The exploration process 

For a vertex v, let C{v) denote the connected component that contains v. 
We recall an exploration process, developed independently by Martin-Lof 
m and Karp m- In this process, vertices will be either active, explored or 
neutral. At each time tG{0,l,...,n}, the number of active vertices will be 
denoted 1) and the number of explored vertices will bet. Fix an ordering of 
the vertices, with v first. At time t = 0, the vertex v is active and all other 
vertices are neutral, so Yq = I. In step t G {1,..., n}, if l)_i > 0 let wt be 
the first active vertex; if l)_i = 0, let wt be the first neutral vertex. Denote 
by r]t the number of neutral neighbors of wt in G(n, 1/n), and change the 
status of these vertices to active. Then, set wt itself explored. 

Write Nt = n — Yt — t — l|y^=o}' Given Yi,... the random variable 

r]t is distributed Bin(X 4 _i, 1/n), and we have the recursion 



Yt-i r/t - 1, Yt-i > 0 

Tit, Yt-i = 0. 


( 1 ) 


At time r = minjt > 1 : Y) = 0} the set of explored vertices is precisely 


C{v), so |C(u)| = r. 


To prove Theorem [H we will couple {Y/} to a random walk with shifted 
binomial increments. We will need the following lemma concerning the 
overshoots of such walks. 
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Lemma 5 Letp G (0,1) and bei.i.d. random variables with Bm{n,p) 

distribution and let St = 1 + “ ^)- integer H > 0, and define 

7 = min{t >1 : St > H oi St = 0} . 

Let E CN be a set of positive integers. Given the event {S^ > H, 'y G E}, the 
conditional distribution of the overshoot S.y — H is stochastically dominated 
by the binomial distribution Bin(n,p). 

Proof. First observe that if ^ has a Bin(n,p) distribution, then the con¬ 
ditional distribution of ^ — r given > r is stochastically dominated by 
Bin(n,p). To see this, write ^ as a sum of n indicator random variables 
and let J be the minimal index such that Given 

J, the conditional distribution of — r is Bin(n — J,p) which is certainly 
dominated by Bin(n,p). 

For any t G E, conditioned on {7 = £} n {S'f-i = H — r} fi {5.y > i?}, 
the overshoot S.y — H equals — r where has a Bin(n,p) distribution 
conditioned on > r. The assertion of the lemma follows by averaging. □ 

Corollary 6 Let X be distributed Bin(n,p) and let f be an increasing real 
function. With the notation of the previous lemma, we have 

E [f{S.y -H)\S^>H,^gE] < E/(X) . 


3 An Easy Upper Bound 


Fix a vertex v. To analyze the component of v in G(n, 1/n), we use the 
notation established in the previous section. We can couple the sequence 
{Vt}t>i constructed there, to a sequence of i.i.d. Bin(n, 1/n) random 

variables, such that > ht for all t < n. The random walk {St} defined 
in Lemma [5] satisfies St = St-i -|- — 1 for all t > 1 and Sq = 1. Fix an 
integer H > 0 and define 7 as in Lemma [5l Couple St and T) such that 
St > Yt for all t < 7 . Since {St} is a martingale, optional stopping gives 
1 = E [S'.y] > H'P{S.y > H) , whence 

Y{S^>H)<^. ( 2 ) 

Write Sf^ = H‘^ + 2H{S^ — H) + {Sy — Hfi and apply Corollary [ 6 ] with 
f{x) = 2Hx + to get for Ff > 2 that 


E 



Sy>H 


< + 2H+ 2 < + 3H . 


(3) 
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Now Sf — (1 — ^)t is also a martingale. By optional stopping, ([2]) and ([3|), 


1 + (1 - -)E7 = E (5^) = P{S^ > H)E 
n ' 




< iiT + 3 , 


hence we have for 2 < H < n — 3 that 

E-f<H + 3. (4) 

We conclude that for 2 < H < n — 3 

Define 7 * = 7 A , and so by the previous inequality and ([2]) we have 

Pis^^ > 0) < P{S^ >H) + P(7 > ^ 2 ) < 4 • (5) 

ri 

Let T = and note that if | (7(^)1 > we must have S^* > 0 so by 
([5]) we deduce P(|C'(u)| > T) < for all 9 < T < (n — 3)^. Denote by Nt 
the number of vertices contained in components larger than T. Then 


> T < P 


P(|Ci| 

Putting T = (^ [V A'n?/^\ ^ for any A> 1 yields 


^ ^ ElVr ^ nPi\Civ)\>T) 


> 


< P(^|Ci| 


> T < 


3n 


3 — 


as 




2/3 1 > 


Av?/^ 

- > ny43/2(l - > 


^ 3/2 ’ 


A3/2r 


□ 


4 Proof of Theorem [T] 

We proceed from ([5]). Define the process {Zt} by 

t 

Zt = - !)• 

i=i 


( 6 ) 


The law of is stochastically dominated by a Bin(n —j, distribution, 
for j < n. Hence, 


E 


e^K*+j-i) I 


< e 


l + -(e=-l) 
n 


^ I „2\n-j 


ic+C^)^-C < 
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as e'^ — 1 < c + for any c £ (0,1) and 1 + x < for x > 0. Since this 
bound is uniform in S^* and 7 *, we have 


E 


„cZt 




< 


± z, _ Cl 

e 2 n 


Write P 5 for the conditional probability given S-f*. Then for any c £ (0,1), 
we have 

Ps[Zt > -5^*) < Ps(e"^* > 6 "'=^^*) < . 

By ([H), if Ey*+j > 0 for all 0 < j < t — 1 , then Zj = Ty*+j — Yy* for all 
1 < j < t. It follows that 


P(^Vj < t Yy*^j > 0 I Sy* > Oj < E FsiZt > —Syjf) \ Syif > 0 

< e*"'-^E[e"V I 5^* > 0 ], 

By Corollary [ 6 ] with H = {1,..., H^}, we have that for c £ (0,1), 


E \'y<H'^,Sy> 0 ]< e 


cH+c+c^ 


(7) 

( 8 ) 


Since {Sy* > 0} = {7 > H^} U {7 < Sy > 0} (a disjoint union), the 
conditional expectation E \ Sy* > 0] is a weighted average of the 

conditional expectation in ([8|) and of E [gcS^* I ^ > ^ 2 j ^ ^cH ^ Therefore 
E[e'^^'i* I Sy* > 0] < whence by ([7]), 

p(vj < t Yy*+j > 0 I > 0 ) < . (9) 


By our coupling, for any integer T > H^, if | (7(^)1 > T then we must have 
Sy* > 0 and Yy*^j > 0 for all j £ [0, T — H"^]. Thus, by ([5]) and ([9]), we have 


P(|C(u)| > T) < P(S’^* > 0)p(yj £ [0,r - H^] Yy*+j > O | > o) 

( 10 ) 


- H 


-+cH+c+c‘‘ 


Take H = and T = for some ^4 > 4; substituting c which 

attains the minimum of the parabola in the exponent of the right hand side 
of (fTO)) gives 


P(|C(u)| > An^/^) < 4re“^/\ mt-h^+d 


< 4?7,~^/^e 4(A-i+2n“'-/3+n“2/3) ^ 4n~^'^^e 


4(A-l/2) 
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since — 2n and n > 1000. As [(A —2)^/2 —2]^ = A^(^/2 — 

2)^ and (A/2 — 2)/{A — 1/2) > 1/4 for A > 8 we get 

P(|C(n)| > Ar?/^) < . 

Denote by Nt the number of vertices contained in components larger 
than T. Then 

p(|Ci|>r) < p(|iVT| > r) < ^ < , 

and we conclude that for all A > 8 and n > 1000, 

O/Q 4 A^iA-i) 

P(|Ci| > Ar?/^) < -e-W-^ . 

□ 


5 Proof of Theorem [2] 

Let h, Ti and T 2 be positive integers, to be specified later. The proof is 
divided in two stages. In the first, we ensure, with high probability, ascent 
of {Yf} to height h by time Ti. In the second stage we show that T) is likely 
to remain positive for T 2 steps. 

Stage 1: Ascent to height h. Dehne 


Th = min{t < Ti : Yt > h} 


if this set is nonempty, and Th = Ti otherwise. If l/_i > 0, then y/ — Y^_i = 
{rjt — I)^ + 2{rjt — l)Yt-i. Recall that rjt is distributed as Bin(At_i, 1/n) 
conditioned on lt_i, and hence if we also require l/_i < h then 


E 


F/ - Y,U I Yt., 


n n n 


Next, we require that h < y/njA and t <Ti = whence 


E 


^2 w2 


Yt-1 


1 

> - 
“ 2 


( 11 ) 


as long as 0 < lt_i < h. Similarly, (fTTIl holds if L)_i = 0. Thus — {t A 
Th)j2 is a submartingale. The proof of Lemma [5] implies that conditional on 
> h, the overshoot 1/-^ — h is stochastically dominated by a Bin(n, 1/n) 
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variable. So, apply Corollary [6] as in ([3|) with f{x) = 2hx + to get that 
+ 3/i < 2h^ for h > 3. By optional stopping, 


2h‘^ > BY? 

‘ h 


> -Exh > —P 

_ 2 ft - 2 


Th 



so 



_ \ 4/^2 32h^ 

Ti)< — < - 

/ Ti n 


( 12 ) 


Stage 2: Keeping Yt positive for T 2 steps. 

Define tq = min{s : Y^^^s = 0} if this set is nonempty, and tq = T 2 
otherwise. Let Mg = h — min{/i, If 0 < Mg-i < h, then 

m2 - m2_i < - 1)2 + 2(1 - rjr^+g)Mg_i , 


so provided h < ^ and s < T 2 < 
we have E Mg - Mg_^ \ Yrf^+g-i,Th 


8h’ 

< 


and recalling that t/j < Ti = 

2 . This also holds if Tr/j+s-i > h, 


so — 2(s A ro)}^Lo i® ^ super martingale. Write P/^ for conditional 

probability given the event > h} and Eh for conditional expectation 
given that event. Since {M2^.^^ — 2(sAro)}jLQ is a supermartingale beginning 
at 0 under E h, optional stopping yields 


^hM^oA.T 2 — 2E/i[to A T 2 ] < 2 T 2 , 


(13) 

whence 



Ph{ro < T 2 ) < P/^(m,oAT2 >h)< 

VI 

(14) 

In conjunction with (I12|]. this yields 



/ \ / \ / \ 32h? 

2 T 2 

(15) 

P(^'T'0 < 2'2j < E^Th — lij + EF/i^To < 'I'2j S ^ 
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Let T 2 = and choose h to approximately minimize the right-hand 

side of (fT^ . This gives h = which satisfies T 2 < -^ and makes 

the right-hand side of ([TH]) less than Since \Ci\ < T 2 implies tq <T 2 , 

this concludes the proof. □ 


6 The Critical Window 


As noted in the introduction, the proofs of Theorems[T]and[2]can be extended 
to the critical ’’window” p = —— for some constant A. For Theorem [2] 

this adaptation is straightforward, and we omit it. However, our proof of 
Theorem [T] used the fact that for A = 0 (that is, p = 1/n) the exploration 
process is stochastically dominated by a mean zero random walk, so we 
include the necessary adaptation below. 


Theorem 7 Set p = for some A G M and consider G{n,p). For 

A > 0 and A > 2X + 3 we have that for large enough n 


P(|C(u)| > An2/3) < + 16)n-V3e- 


and 


P(|Ci| > An2/3) < 


4A 


.A(l-e-4^) A 
For A < 0 and A > 3 we have that for large enough n 


16\ ((A-1)2/2-(A-1)A-2)2 

H—r e 


_OX 1 

—7-h min(5, — 

e — 1 A 


_ / ((A-1)T2-(A-1)A-2)2 

n ' e 4a 


and 


P(|Ci| > An2/3) < 



-|- min(5, 


1 \ ((A-1)T2-(A-1)A-2)^ 


Proof. Assume p = l/n+\n~‘^^^ and that n is large enough; again we bound 
the exploration process with a process {St} defined by St = St-i -|- — 1 

where f,t are distributed as Bin(n,p) and Sq = 1. The two cases of A being 
positive or negative are dealt with separately; assume first A > 0. Since 
1 — < a — a^/3 for small enough a > 0, we have 

Ee-«(«*-i) =e“[l-p(l-e-“)]^ >e“(l-p(a-aV3)r. 
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By Taylor expansion of log(l — x), for small a we have 

logE > a + — p(a — a^/3) + 

= o — (1 + An“^/^)(a — a^/3) + 0(n“^), 

and so for a = 4An“^/^ and n large, we have E e 1 ) > 1 hence {e 
is a submartingale. Take H = and define 7 as in Lemma [5j Then 

by optional stopping we have 

e““ < 1 - P{S^ >H) + P{Sy > , 

and as 1 — e““ < o for a > 0 we get 


4 A 77 

P (^7 >H)< . (16) 

Also, observe that St — is a martingale, hence by optional stopping 

1 + An“^/^E 7 = P(5'-j. > H)P | 5"-^ > Lf] and so by Corollary [ 6 ] we get 
E 7 < ■ For A > 1/4, as (1— 6 “^'*')“^ < 2, this gives that E 7 < 16n^/^. 

It is immediate to check that S'/ — is a submartingale as long as t < 7 , 
hence by optional stopping ^ E [S/1 S-y > H], Using Corollary [ 6 ] 

as in ([3]) and estimating < 2 for a: e (0,1/4] gives the same estimate 

for A G ( 0 ,1/4], Thus 

E7<16n^/^ (17) 

for all A > 0. Take again 7 * = 7 A and as in ([5]) by (11611 and (I17p we get 

P(S.,* > 0 ) < + 16)n-'/3 _ (18^ 


Define Zt as in ([6l) and note that this time its increments can be stochas¬ 
tically dominated by variables distributed as Bin(n—j,p) —1. Similar compu¬ 
tations to the one made in the beginning of Section 0] give that for c G (0,1) 


E 




^ gCtAn-l/3_^+c2t(i+An-l/3) 


and so as before we have 
P^Vj < t Y^*^j > 0 I S^* > 0^ 


< E 


Ps{Zt > —S'7*) I > 0 


< 


< 


^ct\n l/3_^+c2t(l+An I S^* >0] 

^ctXn~^— ^—+c^t{l+\n~^/^)+c{n^+l)+2(c+c'^) 
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where the last inequality is due to Corollary [ 6 l Write t = for some 

constant B and take c £ (0,1) which attains the minimum of the parabola 
in the exponent of the last display. This gives that for large enough n and 
fixed S > 2A + 2 we have 


/ < t > 0 I S^* > 0 I < e 


4(S+1) 


Together with (1181) , as in the proof of Theorem [H we conclude that for any 
T > 2A + 3 we have 


P(|C'(u)| > < 


4A 


. 1 — 

and as before this implies that 

P(|Ci| > ^n2/3) < ^ 


+ 16 In 




+ 


16\ ((A-l)^/2-(A-l)X-2)^ 


- e 


A) 

Assume now p = 1/n + for some fixed A < 0. For a > 0, as 

1 + X < we have 


Ega«t-1) ^ 


l+p(e“-l) 




By Taylor expansion of — 1 we have 


logEe“^^‘ a + (l + An + ^ + 0(o^)), 

and so for a = — An“^/^ > 0 we have that < 1 hence {e“'^*} is a 

supermartingale. With the same H and 7 as before, optional stopping gives 

e“ > 1 - F{S^ >H)+ P{S^ > , 

and as — 1 < 2 x for x small enough we get 

Also, as 7 is bounded above by the hitting time of 0, Wald’s Lemma (see 
[7]) implies that E 7 < —n^/^/A. For A £ [~|)0] it is straight forward to 
verify that A 7 ) is a submartingale, hence as before we deduce by 

optional stopping that E 7 < 5n^/^ for such A’s. Thus we deduce that for 
all A < 0, 

17 < min • 

The rest of the proof continues from (1171) . as in the case of A > 0. □ 


E' 


Remark. Using similar methods, in m, we analyze component sizes of 
bond percolation on random regular graphs. 
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